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SEQUENCE LISTING 





(1) GENERAL INFORMATION: 



(i) APPLICANT: ANDERSON, DARRELL R. 

HANNA, NABIL 



LEONARD , JOHN E. 
NEWMAN , ROLAND A, 
REFF, MITCHELL E. 



RASTETTER, WILLIAM H. 



(ii) TITLE OF INVENTION: THERAPEUTIC APPLICATION OF CHIMERIC AND 
RADIOLABELED ANTIBODIES TO HUMAN B LYMPHOCYTE RESTRICTED 
DIFFERENTIATION ANTIGEN FOR THE TREATMENT OF B CELL 
LYMPHOMA 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PILLSBURY WINTHROP 

(B) STREET: 1100 New York Avenue, N.W. , Ninth FL. 

( C ) CITY : Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/921,060 

(B) FILING DATE: 2 9 -AUG- 1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Teskin, Robin L. 

(B) REGISTRATION NUMBER: 35,030 

(C) REFERENCE/DOCKET NUMBER: 03 7003-0275463 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-861-3000 

(B) TELEFAX: 202-822-0944 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 




(iii) 



NUMBER OF SEQUENCES: 11 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GGGAGCTTGG ATCGATCCTC TATGGTT 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 
AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 
GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 18 0 
ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 24 0 
GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 3 00 
GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 360 
AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 
GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 
ACGTCAATAA TGACGTATGT TCC CATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 54 0 
TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 
AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 
ATGACCTTAT GGGACTTTCC TACTTGGCAG TACAT CTACG TATTAGTCAT CGCTATTACC 720 
ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 78 0 
TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 84 0 
GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 
CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 
CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCTCCT GCTGCTCTGG 102 0 
CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAATCA AACGTACGGT GGCTGCACCA 1080 
TCTGTCTTCA TCTTCCCGCC AT CTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 114 0 
TGCCTGCTGA ATAACTTCTA TCC C AG AG AG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1200 
CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 12 60 
AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 132 0 
TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 13 80 
TGTTGAATTC AGATCCGTTA ACGGTTAC C A ACTACCTAGA CTGGATTCGT GACAACATGC 144 0 
GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1500 
GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 15 60 
TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 162 0 
GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1680 
GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 174 0 
ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 1800 
TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT T AC CATGGTG ATGCGGTTTT 1860 
GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 192 0 
CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 198 0 
GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 2 04 0 
TAAGCAGAGC TGGGTACGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 2100 
ATGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CGCTAGCACC 2160 
AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 222 0 
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GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 22 80 
GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 23 4 0 
TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 24 00 
AACGTGAATC ACAAGCC CAG CAACACCAAG GTGGACAAGA AAGCAGAGCC CAAATCTTGT 24 60 
GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 252 0 
TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGT CAC A 2 580 
TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 2640 
GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 2700 
CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGACTACAAG 2760 
TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 2 82 0 
GGGCAGCCCC GAGAACCACA GGTGT AC AC C CTGCCCCCAT CCCGGGATGA GCTGACCAGG 2880 
AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 2 94 0 
TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 3 000 
GACGGCTCCT TCTTCCTCTA CAGCAAGCTC AC CGTGG AC A AGAGCAGGTG GCAGCAGGGG 3 060 
AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 312 0 
CTCTCCCTGT CTCCGGGTAA ATGAGGATCC GTTAACGGTT ACCAACTACC TAGACTGGAT 3180 
TCGTGACAAC ATGCGGCCGT GATATCTACG TATGATCAGC CTCGACTGTG CCTTCTAGTT 324 0 
GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 33 00 
CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGT C ATT 3360 
CTATTCTGGG GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA 3420 
GGCATGCTGG GGATGCGGTG GGCTCTATGG AACCAGCTGG GGCTCGACAG CGCTGGATCT 34 80 
CCCGATCCCC AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 3 54 0 
AATTTTAACA C CAATT CAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 3 600 
AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 3 66 0 
GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 3 72 0 
GCCTGATTCC GTAGAGCCAC AC CTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 3 78 0 
GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC AC ATTTG CTT 3 84 0 
CTGACATAGT TGTGTTGGGA GCTTGGATAG CTTGGACAGC TCAGGGCTGC GATTTCGCGC 3 900 
CAAACTTGAC GGCAATCCTA GCGTGAAGGC TGGTAGGATT TTATCCCCGC TGCCATCATG 3 960 
GTTCGACCAT TGAACTGCAT CGTCGCCGTG TCCCAAAATA TGGGGATTGG CAAGAACGGA 4 02 0 
GACCTACCCT GGCCTCCGCT CAGGAACGAG TTCAAGTACT TCCAAAGAAT GACCACAACC 4 08 0 
TCTTCAGTGG AAGGTAAACA GAAT CTGGTG ATTATGGGTA GGAAAACCTG GTTCTCCATT 414 0 
CCTGAGAACA ATCGACCTTT AAAGGACAGA ATTAATATAG TTCTCAGTAG AGAACTCAAA 42 0 0 
GAACCACCAC GAGGAGCTCA TTTTCTTGCC AAAAGTTTGG ATGATG C CTT AAGACTTATT 42 60 
GAACAACCGG AATTGGCAAG TAAAGTAGAC ATGGTTTGGA TAGT CGGAGG CAGTTCTGTT 4 32 0 
TACCAGGAAG CCATGAATCA ACCAGGCCAC CTTAGACTCT TTGTGACAAG GATCATGCAG 4380 
GAATTTGAAA GTGACACGTT TTTCCCAGAA ATTGATTTGG GGAAATATAA ACTTCTCCCA 444 0 
GAATAC CCAG GCGTCCTCTC TGAGGTCCAG GAGGAAAAAG GCATCAAGTA TAAGTTTGAA 45 00 
GTCTACGAGA AGAAAGACTA ACAGGAAGAT GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 4560 
TCATGCATTT TTATAAGAC C ATGGGACTTT TGCTGGCTTT AGATCAGCCT CGACTGTGCC 4 62 0 
TTCTAGTTGC CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 468 0 
TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG 4 74 0 
GTGTCATTCT ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 4 800 
CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 486 0 
TAGCTTTGCT TCTCAATTTC TTATTTGCAT AATGAGAAAA AAAGGAAAAT TAATTTTAAC 4 92 0 
AC CAATT CAG TAGTTGATTG AGCAAATGCG TTGCCAAAAA GGATGCTTTA GAGACAGTGT 4 98 0 
TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 504 0 
GCCAGTGAGT GGCACAGCAT TCTAGGGAGA AATATGCTTG TCATCACCGA AGCCTGATTC 5100 
CGTAGAGCCA CACCTTGGTA AGGGCCAATC TGCTCACACA GGATAGAGAG GGCAGGAGCC 5160 
AGGGCAGAGC ATATAAGGTG AGGTAGGATC AGTTGCTCCT CACATTTGCT TCTGACATAG 522 0 
TTGTGTTGGG AGCTTGGATC GATCCTCTAT GGTTGAACAA GATGGATTGC ACGCAGGTTC 528 0 
TCCGGCCGCT TGGGTGGAGA GGCTATTCGG CTATGACTGG GCACAACAGA CAATCGGCTG 534 0 
CTCTGATGCC GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC CCGGTTCTTT TTGTCAAGAC 54 00 
CGACCTGTCC GGTGCCCTGA ATGAACTGCA GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 54 60 
CACGACGGGC GTTCCTTGCG CAGCTGTGCT CGACGTTGTC ACTGAAGCGG GAAGGGACTG 5520 
GCTGCTATTG GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA TCTCACCTTG CTCCTGCCGA 558 0 
GAAAGTATCC ATCATGGCTG ATGCAATGCG GCGGCTGCAT ACGCTTGATC CGGCTACCTG 5 64 0 
CCCATTCGAC CACCAAGCGA AACATCGCAT CGAGCGAGCA CGTACTCGGA TGGAAGCCGG 5700 
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TCTTGTCGAT CAGGATGATC TGGACGAAGA GCATCAGGGG CTCGCGCCAG CCGAACTGTT 5760 

CGCCAGGCTC AAGGCGCGCA TGCCCGACGG CGAGGATCTC GTCGTGACCC ATGGCGATGC 5820 

CTGCTTGCCG AATATCATGG TGGAAAATGG CCGCTTTTCT GGATTCATCG ACTGTGGCCG 5880 

GCTGGGTGTG GCGGACCGCT ATCAGGACAT AGCGTTGGCT ACC CGTGATA TTGCTGAAGA 5940 

GCTTGGCGGC GAATGGGCTG ACCGCTTCCT CGTGCTTTAC GGTATCGCCG CTCCCGATTC 60 00 

GCAGCGCATC GCCTTCTATC GCCTTCTTGA CGAGTTCTTC TGAGCGGGAC TCTGGGGTTC 6060 

GAAATGACCG ACCAAGCGAC GCCCAACCTG CCATCACGAG ATTTCGATTC CACCGCCGCC 6120 

TTCTATGAAA GGTTGGGCTT CGGAATCGTT TTCCGGGACG CCGGCTGGAT GATCCTCCAG 6180 

CGCGGGGATC TCATGCTGGA GTTCTTCGCC CACCCCAACT TGTTTATTGC AGCTTATAAT 624 0 

GGTTACAAAT AAAGCAATAG CAT C AC AAAT TTCACAAATA AAGCATTTTT TTCACTGCAT 63 00 

TCTAGTTGTG GTTTGTCCAA ACT CAT CAAT CTATCTTATC ATGTCTGGAT CGCGGCCGCG 63 60 

ATCCCGTCGA GAGCTTGGCG TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC 64 2 0 

CGCTCACAAT TCCACACAAC ATACGAGCCG G AAG CAT AAA GTGTAAAGCC TGGGGTGCCT 64 80 

AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC C AGT CGGGAA 654 0 

ACCTGTCGTG CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 66 00 

TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 6660 

GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 672 0 

CAGGAAAGAA CATGTGAGCA AAAGGC CAGC AAAAGGC CAG GAACCGTAAA AAGGCCGCGT 6780 

TGCTGGCGTT TTTC CATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA 684 0 

GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT 69 00 

CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 6960 

CTTCGGGAAG CGTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 702 0 

TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 7080 

TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG 714 0 

CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA 72 00 

AGTGGTGGCC TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA 72 60 

AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 73 2 0 

GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCT CAAG 73 80 

AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG 744 0 

GGATTTTGGT CATGAGATTA T C AAAAAGG A TCTTCACCTA GATCCTTTTA AATTAAAAAT 7 500 

GAAGTTTTAA AT CAATCTAA AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT 7 560 

TAATCAGTGA GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 7 62 0 

TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 7 680 

TGATACCGCG AGACCCACGC TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG 7 74 0 

GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT 78 00 

GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA 78 60 

TTGCTACAGG CATCGTGGTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT 7 92 0 

CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 7 980 

TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG 804 0 

CAGCACTGCA TAATTCTCTT ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG 8100 

AGTACTCAAC CAAGTCATTC TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG 8160 

CGTCAATACG GGATAATACC GCGCCACATA GCAGAACTTT AAAAGTGCTC AT CATTGGAA 8220 

AACGTTCTTC GGGGCGAAAA CTCTCAAGGA TCTTAC CGCT GTTGAGATCC AGTTCGATGT 82 80 

AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTT CTGGGT 834 0 

GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT 84 00 

GAATACTCAT ACTCTTCCTT TTT CAAT ATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA 84 60 

TGAGCGGATA CATATTTGAA TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT 8520 

TTCCCCGAAA AGTGCCACCT 854 0 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT T AGTCAGC C A TGCATGGGGC 120 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 360 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 48 0 

ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 54 0 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 66 0 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT ACCGGTTTGA CTCACGCGGA 780 

TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 84 0 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 

CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 

CATCACAGAT CTCTCACTAT GGATTTTCAG GTGCAGATTA TCAGCTTCCT GCTAATCAGT 102 0 

GCTTCAGTCA TAATGTCCAG AGGACAAATT GTTCTCTCCC AGTCTCCAGC AATCCTGTCT 1080 

GCATCTCCAG GGGAGAAGGT CACAATGACT TGCAGGGCCA GCTCAAGTGT AAGTTACATC 114 0 

CACTGGTTCC AGCAGAAGCC AGGATCCTCC CCCAAACCCT GGATTTATGC CACATCCAAC 1200 

CTGGCTTCTG GAGTCCCTGT TCGCTTCAGT GGCAGTGGGT CTGGGACTTC TTACTCTCTC 1260 

ACAATCAGCA GAGTGGAGGC TGAAGATGCT GCCACTTATT ACTGCCAGCA GTGGACTAGT 132 0 

AACCCACCCA CGTTCGGAGG GGGGACCAAG CTGGAAATCA AACGTACGGT GGCTGCACCA 1380 

TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1440 

TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1500 

CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1560 

AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 162 0 

TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 1680 

TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 174 0 

GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1800 

GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC I860 

TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 192 0 

GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1980 

GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 204 0 

ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 2100 

TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 2160 

GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 222 0 

CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 2280 

GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 234 0 

TAAGCAGAGC TGGGTACGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 2400 

ATGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CCTGTCCCAG 2460 

GTACAACTGC AGCAGCCTGG GGCTGAGCTG GTGAAGCCTG GGGCCTCAGT GAAGATGTCC 2520 

TGCAAGGCTT CTGGCTACAC ATTTACCAGT TACAATATGC ACTGGGTAAA ACAGACACCT 2580 

GGTCGGGGCC TGGAATGGAT TGGAGCTATT TATCCCGGAA ATGGTGATAC TTCCTACAAT 264 0 

CAGAAGTTCA AAGGCAAGGC CACATTGACT GCAGACAAAT CCTCCAGCAC AGCCTACATG 2700 

CAGCTCAGCA GCCTGACATC TGAGGACTCT GCGGTCTATT ACTGTGCAAG ATCGACTTAC 2760 

TACGGCGGTG ACTGGTACTT CAATGTCTGG GGCGCAGGGA CCACGGTCAC CGTCT CTGCA 2820 

GCTAGCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 2880 

GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 2940 

TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 3000 

GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 3060 

TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGCAGAGCCC 312 0 



AAATCTTGTG ACAAAACTCA CACATGCCCA CCGTGCCCAG CACCTGAACT CCTGGGGGGA 3180 

CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT 324 0 

GAGGTCACAT GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG 3300 

TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA GCAGTACAAC 3360 

AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG 3420 

GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC CCATCGAGAA AACCATCTCC 34 80 

AAAGC CAAAG GGCAGCCCCG AGAAC CAC AG GTGTACACCC TGCCCCCATC CCGGGATGAG 354 0 

CTGACCAAGA AC C AGGTC AG CCTGACCTGC CTGGTCAAAG GCTTCTATCC CAGCGACATC 3 600 

GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG GAGAACAACT ACAAGACCAC GCCTCCCGTG 3660 

CTGGACTCCG ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA CCGTGGACAA GAGCAGGTGG 3720 

CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG CTCTGGACAA CCACTACACG 3 780 

CAGAAGAGCC TCTCCCTGTC T C CGGGT AAA TGAGGATCCG TTAACGGTTA CCAACTACCT 3 84 0 

AGACTGGATT CGTGACAACA TGCGGCCGTG ATATCTACGT ATGATCAGCC TCGACTGTGC 3 900 

CTTCTAGTTG CCAGCCATCT GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 3 960 

GTGCCACTCC CACTGTCCTT T C CTAATAAA ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 4 02 0 

GGTGTCATTC TATTCTGGGG GGTGGGGTGG GGCAGGACAG CAAGGGGGAG GATTGGGAAG 4 08 0 

ACAATAGCAG GCATGCTGGG GATGCGGTGG GCT CTATGGA ACCAGCTGGG GCTCGACAGC 414 0 

GCTGGATCTC CCGATCCCCA GCTTTGCTTC TCAATTTCTT ATTTGCATAA TGAGAAAAAA 42 00 

AGGAAAATTA ATTTTAACAC CAATTCAGTA GTTGATTGAG CAAATGCGTT GCCAAAAAGG 42 6 0 

ATGCTTTAGA G AC AGTGTT C TCTG CACAGA TAAGGACAAA C ATT ATT C AG AGGGAGTACC 432 0 

CAGAGCTGAG ACTCCTAAGC CAGTGAGTGG CACAGCATTC TAGGGAGAAA TATGCTTGTC 43 8 0 

ATCACCGAAG CCTGATTCCG TAGAGC CACA CCTTGGTAAG GGCCAATCTG CT C AC AC AGG 444 0 

ATAGAGAGGG CAGGAGCCAG GGCAGAGCAT ATAAGGTGAG GTAGGATCAG TTGCTCCTCA 4 500 

CATTTGCTTC TGACATAGTT GTGTTGGGAG CTTGGATAGC TTGGACAGCT CAGGGCTGCG 4 560 

ATTTCGCGCC AAACTTGACG GCAATCCTAG CGTGAAGGCT GGTAGGATTT TATCCCCGCT 4 62 0 

GCCATCATGG TTCGACCATT GAACTGCATC GTCGCCGTGT CCCAAAATAT GGGGATTGGC 46 80 

AAGAACGGAG ACCTACCCTG GCCTCCGCTC AGGAACGAGT TCAAGTACTT CCAAAGAATG 4 74 0 

ACCACAACCT CTTCAGTGGA AGGTAAACAG AATCTGGTGA TTATGGGTAG GAAAACCTGG 4800 

TTCTCCATTC CTGAGAAGAA TCGACCTTTA AAGGACAGAA TTAATATAGT TCTCAGTAGA 4860 

GAACTCAAAG AACCACCACG AGGAGCTCAT TTTCTTGCCA AAAGTTTGGA TGATGCCTTA 4 92 0 

AGACTTATTG AACAAC CGGA ATTGGCAAGT AAAGTAGACA TGGTTTGGAT AGTCGGAGGC 4 980 

AGTTCTGTTT ACCAGGAAGC CATGAATCAA CCAGGCCACC TTAGACTCTT TGTGACAAGG 5 04 0 

ATCATGCAGG AATTTGAAAG TGACACGTTT TTCCCAGAAA TTGATTTGGG GAAATATAAA 5100 

CTTCTCCCAG AATACC CAGG CGTCCTCTCT GAGGTCCAGG AGGAAAAAGG CATCAAGTAT 516 0 

AAGTTTGAAG TCTACGAGAA GAAAGACTAA CAGGAAGATG CTTTCAAGTT CTCTGCTCCC 522 0 

CTCCTAAAGC TATGCATTTT TATAAGACCA TGGGACTTTT GCTGGCTTTA GATCAGCCTC 52 80 

GACTGTGCCT TCTAGTTGCC AGCCATCTGT TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC 534 0 

CCTGGAAGGT GCCACTCCCA CTGTCCTTTC CTAATAAAAT GAGGAAATTG CATCGCATTG 54 00 

TCTGAGTAGG TGTCATTCTA TTCTGGGGGG TGGGGTGGGG CAGGACAGCA AGGGGGAGGA 54 6 0 

TTGGGAAGAC AATAGCAGGC ATGCTGGGGA TGCGGTGGGC TCTATGGAAC CAGCTGGGGC 5520 

TCGAGCTACT AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 558 0 

AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 564 0 

AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 57 00 

GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CAT CAC CGAA 57 6 0 

GCCTGATTCC GTAGAGCCAC AC CTTGGT AA GGGCCAATCT GCTCACACAG GATAGAGAGG 582 0 

GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 588 0 

CTGACATAGT TGTGTTGGGA GCTTGGATCG ATCCTCTATG GTTGAACAAG ATGGATTGCA 594 0 

CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG CACAACAGAC 600 0 

AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC CGGTTCTTTT 6060 

TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAG GACGAGGCAG CGCGGCTATC 6120 

GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA CTGAAGCGGG 6180 

AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT CTCACCTTGC 624 0 

TCCTGCCGAG AAAGTAT CCA TCATGGCTGA TGCAATGCGG CGGCTGCATA CGCTTGATCC 63 00 

GGCTACCTGC CCATTCGACC AC CAAGCGAA ACATCGCATC GAGCGAGCAC GT ACT CGG AT 6360 

GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG CAT CAGGGGC TCGCGCCAGC 642 0 

CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG TCGTGACCCA 64 80 

TGGCGATGCC TGCTTGCCGA AT AT C ATGGT GGAAAATGGC CGCTTTTCTG GATTCATCGA 654 0 

CTGTGGCCGG CTGGGTGTGG CGGACCGCTA TCAGGACATA GCGTTGGCTA C C CGTGAT AT 6600 



TGCTGAAGAG CTTGGCGGCG AATGGGCTGA CCGCTTCCTC GTGCTTTACG GTATCGCCGC 
SSSJSS CAGCGCATCG CCTTCTATCG CCTTCTTGAC OAGTTCTTCT GAGCGGGACT 
TCQ aaaTGAC CGA CCAAGCGACG CCCAACCTGC CATCACGAGA TTTCGATTCC 
ScSSS? otISaaao GTTGGGCTTC GGAATCGTTT TCCGGGACGC CGGCTGGATG 

atcScSS gcgSggatct catgctggag ttcttcgccc accccaactt gtttattgca 

GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA AGCATTTTTT 

?SctcSS ctagttgtgg tttgtccaaa ctcatcaatc tatcttatca tgtctggatc 
gcSgSSgI tcccgtcgag agcttggcgt aatcatggtc atagctgttt cctgtgtgaa 
JSgSSS gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct 
g*Sg™5a Sgagtcagc taactcacat taattgcgtt gcgctcactg cccgctttcc 
Jg?SggaU Stgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg 
otSSc^ Sggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc 
ggctcSSS agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag 
gggSScgS aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 
aSSgIS?? Sctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc 
gSScSg ?cagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc 
SgSSgSc cScgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg 
SSgaagc gtggcgcttt ctcaatgctc acgctgtagg tatctcagtt 
cStcSggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc 

cSSSS? ATCCGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC 
caSggcagc agccStggt AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG 

SSSSSS otgSggcct aactacggct acactagaag gacagtattt ggtatctgcg 
ctSSSSS gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa 
cSSgSS Sgcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 
gSScSS agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact 
SSSSJS JSStgotc atgagattat caaaaaggat cttcacctag atccttttaa 

ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT 

SSSS? aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag 
?£S£SaS ^ccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca 
SSSgSS gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc 
aSSgcSg aSgccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt 
Sttaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg 
SSSS?SS ScScaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca 
^tJSgttc ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg 
?SgSccS cgg^cScS atcgttgtca gaagtaagtt ggccgcagtg ttatcactca 
SotStcgc agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg 
SJSSctS otactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct 
cSSSSc* SSSSSgg gataataccg cgccacatag cagaacttta aaagtgctca 
tca?tcgaaJ acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca 

SJSaSSa" ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG 

?ScSgg?g agcSaIaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac 

rrAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT 

SracTcl? gSSSac atatttgaat gtatttagaa aaataaacaa ATAGGGGTTC 

CGCGCACATT TCCCCGAAAA GTGCCACCT 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
ATCACAGAT C TCTCACCATG GATTTTCAGG TGCAGATTAT CAGCTTC 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iv) ANT I- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TGCAGCATCC - GTACGTTTGA TTTCCAGCTT 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .384 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

ATG GAT TTT CAG GTG CAG ATT ATC AGC TTC CTG CTA ATC AGT GCT TCA 4 8 

Met Asp Phe Gin Val Gin He He Ser Phe Leu Leu He Ser Ala Ser 
15 10 15 

GTC ATA ATG TCC AGA GGG CAA ATT GTT CTC TCC CAG TCT CCA GCA ATC 96 
Val He Met Ser Arg Gly Gin He Val Leu Ser Gin Ser Pro Ala He 
20 25 30 

CTG TCT GCA TCT CCA GGG GAG AAG GTC ACA ATG ACT TGC AGG GCC AGC 144 
Leu Ser Ala Ser Pro Gly Glu Lys Val Thr Met Thr Cys Arg Ala Ser 
35 40 45 

TCA AGT GTA AGT TAC ATC CAC TGG TTC CAG CAG AAG CCA GGA TCC TCC 192 
Ser Ser Val Ser Tyr He His Trp Phe Gin Gin Lys Pro Gly Ser Ser 
50 55 60 



CCC AAA CCC TGG ATT TAT GCC ACA TCC AAC CTG GCT TCT GGA GTC CCT 24 0 

Pro Lys Pro Trp lie Tyr Ala Thr Ser Asn Leu Ala Ser Gly Val Pro 
65 70 75 80 

GTT CGC TTC AGT GGC AGT GGG TCT GGG ACT TCT TAC TCT CTC ACA ATC 28 8 

Val Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr lie 
85 90 95 

AGC AGA GTG GAG GCT GAA GAT GCT GCC ACT TAT TAC TGC CAG CAG TGG 33 6 

Ser Arg Val Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp 
100 105 110 

ACT AGT AAC CCA CCC ACG TTC GGA GGG GGG ACC AAG CTG GAA ATC AAA 3 84 

Thr Ser Asn Pro Pro Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 
115 120 125 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Asp Phe Gin Val Gin He He Ser Phe Leu Leu lie Ser Ala Ser 
15 10 15 

Val He Met Ser Arg Gly Gin He Val Leu Ser Gin Ser Pro Ala He 
20 25 30 

Leu Ser Ala Ser Pro Gly Glu Lys Val Thr Met Thr Cys Arg Ala Ser 
35 40 45 

Ser Ser Val Ser Tyr He His Trp Phe Gin Gin Lys Pro Gly Ser Ser 
50 55 60 

Pro Lys Pro Trp He Tyr Ala Thr Ser Asn Leu Ala Ser Gly Val Pro 
65 70 75 80 

Val Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr He 
85 90 95 

Ser Arg Val Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp 
100 105 110 

Thr Ser Asn Pro Pro Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) * TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GCGGCTCCCA CGCGTGTCCT GTCCCAG 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iv) ANT I- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GGSTGTTGTG CTAGCTGMRG AGACRGTGA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .420 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATG GGT TGG AGC CTC ATC TTG CTC TTC CTT GTC GCT GTT GCT ACG CGT 

Met Gly Trp Ser Leu lie Leu Leu Phe Leu Val Ala Val Ala Thr Arg 

15 10 15 

GTC CTG TCC CAG GTA CAA CTG CAG CAG CCT GGG GCT GAG CTG GTG AAG 

Val Leu Ser Gin Val Gin Leu Gin Gin Pro Gly Ala Glu Leu Val Lys 

20 25 30 
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CCT GGG GCC TCA GTG AAG ATG TCC TGC AAG GCT TCT GGC TAC ACA TTT 144 
Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe 
35 40 45 

ACC AGT TAC AAT ATG CAC TGG GTA AAA CAG ACA CCT GGT CGG GGC CTG 192 
Thr Ser Tyr Asn Met His Trp Val Lys Gin Thr Pro Gly Arg Gly Leu 
50 55 60 

GAA TGG ATT GGA GCT ATT TAT CCC GGA AAT GGT GAT ACT TCC TAC AAT 24 0 

Glu Trp He Gly Ala He Tyr Pro Gly Asn Gly Asp Thr Ser Tyr Asn 
65 70 75 80 

CAG AAG TTC AAA GGC AAG GCC ACA TTG ACT GCA GAC AAA TCC TCC AGC 288 
Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser 
85 90 95 

ACA GCC TAC ATG CAG CTC AGC AGC CTG ACA TCT GAG GAC TCT GCG GTC 336 
Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val 
100 105 HO 

TAT TAC TGT GCA AGA TCG ACT TAC TAC GGC GGT GAC TGG TAC TTC AAT 3 84 

Tyr Tyr Cys Ala Arg Ser Thr Tyr Tyr Gly Gly Asp Trp Tyr Phe Asn 
115 120 125 

GTC TGG GGC GCA GGG ACC ACG GTC ACC GTC TCT GCA 42 0 

Val Trp Gly Ala Gly Thr Thr Val Thr Val Ser Ala 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Gly Trp Ser Leu lie Leu Leu Phe Leu Val Ala Val Ala Thr Arg 
15 10 15 

Val Leu Ser Gin Val Gin Leu Gin Gin Pro Gly Ala Glu Leu Val Lys 
20 25 30 

Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe 
35 40 45 

Thr Ser Tyr Asn Met His Trp Val Lys Gin Thr Pro Gly Arg Gly Leu 
50 55 60 

Glu Trp He Gly Ala He Tyr Pro Gly Asn Gly Asp Thr Ser Tyr Asn 
65 70 75 80 



Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Ala Asp Lys Ser Ser Ser 
85 90 95 
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Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val 
100 105 HO 

Tyr Tyr Cys Ala Arg Ser Thr Tyr Tyr Gly Gly Asp Trp Tyr Phe Asn 
115 120 125 

Val Trp Gly Ala Gly Thr Thr Val Thr Val Ser Ala 
130 135 140 



